Identification of chromosomal abnormalities in miscarriages by CNV-Seq

Objective The primary object of this study is to analyze chromosomal abnormalities in miscarriages detected by copy number variants sequencing (CNV-Seq), establish potential pathways or genes related to miscarriages, and provide guidance for birth health in the following pregnancies. Methods This study enrolled 580 miscarriage cases with paired clinical information and chromosomal detection results analyzed by CNV-Seq. Further bioinformatic analyses were performed on validated pathogenic CNVs (pCNVs). Results Of 580 miscarriage cases, three were excluded as maternal cell contamination, 357 cases showed abnormal chromosomal results, and the remaining 220 were normal, with a positive detection rate of 61.87% (357/577). In the 357 miscarriage cases, 470 variants were discovered, of which 65.32% (307/470) were pathogenic. Among all variants detected, 251 were numerical chromosomal abnormalities, and 219 were structural abnormalities. With advanced maternal age, the proportion of numerical abnormalities increased, but the proportion of structural abnormalities decreased. Kyoto Encyclopedia of Genes and Genomes pathway and gene ontology analysis revealed that eleven pathways and 636 biological processes were enriched in pCNVs region genes. Protein–protein interaction analysis of 226 dosage-sensitive genes showed that TP53, CTNNB1, UBE3A, EP300, SOX2, ATM, and MECP2 might be significant in the development of miscarriages. Conclusion Our study provides evidence that chromosomal abnormalities contribute to miscarriages, and emphasizes the significance of microdeletions or duplications in causing miscarriages apart from numerical abnormalities. Essential genes found in pCNVs regions may account for miscarriages which need further validation. Supplementary Information The online version contains supplementary material available at 10.1186/s13039-024-00671-7.


Introduction
Miscarriage is spontaneous pregnancy loss that happens before 28 gestational weeks and can be divided into early miscarriage and late miscarriage according to a boundary of 12 gestational weeks.As reported, the incidence of miscarriage is about 15-20%, with an increasing trend year by year [1].About 1% of couples have experienced repeated pregnancy losses, mainly in early pregnancy [2].The causes of miscarriage are complex, and the recognized reasons mostly comprise genetic factors, maternal factors, and environmental factors [3,4].
Studies have revealed that chromosome aneuploidies and polyploidies are frequently detected in miscarriages, especially in early miscarriages, with an approximate prevalence of 50% [2,3].Further studies have also discovered copy number variants (CNVs) in miscarriages, and a systematic analysis has revealed the relationship between CNVs and miscarriages [5,6].CNVs refer to deletions or duplications of genome segments, some CNVs may contain numerous genes or overlay with regions of identified chromosomal disorders, but how CNVs result in miscarriages is still unclear.
Commonly, genetic analysis of abortion tissues mainly includes chromosome karyotype and chromosome microarray analysis (CMA).In recent years, with the advance of the next generation sequencing, CNV-Seq has been frequently applied in detecting chromosomal abnormalities clinically.CNV-Seq can find chromosome aneuploidies, polyploidies, microduplications, and microdeletions.Compared to karyotype and CMA, CNV-Seq is more high-resolution and more comprehensive in the detection of variants [7][8][9].In addition, the cost of CNV-Seq is lower than CMA.
In this study, we systematically evaluated the chromosomal abnormalities of early and middle trimester abortion by CNV-Seq.We clarified the relationship between chromosomal abnormalities and miscarriages.Moreover, we analyzed the critical regions of CNVs detected to identify potential candidate genes related to miscarriages and further performed functional gene analysis using gene enrichment and protein interaction analysis.This study may help to screen the genetic causes of aborted fetuses and provide reproductive guidance for women of childbearing age.

Subjects
580 spontaneous abortions that happened between 5 and 28 gestational weeks from January 2018 to December 2021 were enrolled in this study.The Medical Ethics Committee of Zhongnan Hospital of Wuhan University approved this study (Approval Number: 2023015K), and the written informed consent of all patients was obtained.

Detection of chromosomal abnormalities
10 mg products of concept (POCs) were collected for chromosomal abnormalities detection, and maternal peripheral blood was collected for maternal contamination exclusion.DNA was extracted by a Qiagen DNA Blood Midi/Mini kit (Qiagen GmbH, Hilden, Germany), the purity and yield of DNA products were determined using NanoDrop spectrophotometer and agarose gel electrophoresis.STR analysis was conducted by a commercial kit (Microread, Beijing, China) before CNV-Seq, selected STR analysis markers were D19S433, D5S818, D6S1043, AMEL, D3S1358, D7S820, D16S539, CSF1PO, Penta, D2S441, D8S1179, FGA, D2S1338 and D13S317.In the end, 577 samples went on CNV-Seq, and the other three were excluded for maternal contamination.The qualified DNA genome was randomly broken into an average size of 200 bp, and the library was constructed by terminal repair, connector connection, and other methods using the high-throughput sequencing library kit of Beijing Berry.The NextSeq 500 platform (Illumina, San Diego, USA) was used to perform massively parallel sequencing with a reading length of 36 bp and a reading depth of 0.5×, which was a single-end sequencing.GRCh37/hg19 was the reference sequence for further analysis, and only CNVs larger than 100 kb were reported.Database of Genomic Variants (DGV) (16), DECIPHER (https:// www.decip herge nomics.org/), OMIM (https:// omim.org/ ® ), and PubMed (https:// pubmed.ncbi.nlm.nih.gov/) were explored for CNVs mapping and identification.For further exploration of the effects of CNVs beyond aneuploidies and polyploidies in miscarriages, we classified the variants as either numerical or structural abnormalities.Aneuploidies and polyploidies were categorized as numerical abnormalities, whereas deletions or duplications were classified as structural abnormalities.According to the American College of Medical Genetics and Genomics (ACMG) technical standards, structural abnormalities were classified into five categories: pathogenic CNVs (pCNVs), likely pathogenic CNVs (lpCNVs), variants of uncertain significance (VOUS), likely benign CNVs (lbCNVs), and benign CNVs (bCNVs) [10].

Statistical analysis
Comparisons between groups were conducted using the chi-square test, which was performed in SPSS Statistics software v26.0 (IBM Corp, Armonk, NY).P < 0.05 was considered statistically significant.

Subject characteristics
A total of 580 abortion tissue samples were collected in this study, of which 3 samples were excluded due to maternal cell contamination, and CNV-Seq tested the remaining 577 samples (Fig. 1).The average gestational age at the time of CNV abnormal fetal abortion was 9.58 ± 2.93 weeks, ranging from 5 to 28 weeks.The average age of pregnant women was 31.17 ± 4.33 years old, ranging from 22 to 44 years old.269 (46.62%) pregnant women were younger than 30 years old, 207 (35.88%) were 30-35 years old, 72 (12.48%) were 35-40 years old, and 29 (5.03%) were older than 40 years old.

Comparison of CNVs among pregnancies of different maternal ages or gestational weeks
To analyze the association between various chromosomal abnormalities and maternal age (MA) in miscarriages, we divided pregnancies into 4 groups: MA < 30 y, 30 y ≤ MA < 35 y, 35 y ≤ MA < 40 y, and MA ≥ 40 y, the related kinds of variants were numerical chromosomal abnormalities and structural abnormalities (Table 2).For detailed analysis of structural abnormalities in different groups, variants were also grouped into pCNVs, lpCNVs, bCNVs, lbCNVs and VOUS, however, apart from pCNVs and VOUS the sizes of the remaining three groups were too small for effective statistical analyses.The fraction of abnormal chromosome numbers steadily rose with MA (Fig. 2C), with a significant statistical difference (χ 2 = 25.379,P < 0.05).The ratio of structural chromosomal abnormalities gradually decreased with the increasing age of pregnant women, with a statistically significant difference (χ 2 = 11.722,P < 0.05).
To analyze the relationship between different chromosomal abnormalities and gestation weeks (GW), we divided pregnancies into to groups: early gestation week (GW ≤ 12 w) and middle gestation week (12 w < GW ≤ 28 w), variants were grouped as above (Table 2).Most of miscarriages happened during early stage of gestation might be the result of chromosomal abnormalities.As for specific kinds of variants, such as pCNVs or VOUS, there was no difference between GW.

Potential candidate genes and signaling pathways analyzed in miscarriages
KEGG pathway analysis revealed that the genes located in pCNVs significantly clustered in the following eleven pathways (Fig. 3C

Table 1 Baseline characteristics of cases with chromosomal abnormalities
A Mean ± standard deviation B In this part, cases were grouped into cases with numerical abnormalities (aneuploidies, polyploidies and mosaicism) or cases with structural abnormalities (deletions or duplications).Structural abnormalities were classified as pCNVs, lpCNVs, bCNVs, lbCNVs and VOUS according to ACMG guidelines C Of the 357 cases detected with variants, 5 cases were detected with both numerical abnormalities and structural abnormalities, the 5 cases were represented twice in this table
There are 226 dosage-sensitive genes in 56 pCNVs (Additional file 1: Table S1).As shown in Fig. 3D, we analyzed the association among genes selected according to their betweenness.With higher betweenness, larger nodes were shown.PPI network analysis revealed seven hub genes in miscarriages as TP53, CTNNB1, UBE3A, EP300, SOX2, ATM, and MECP2.

Discussion
Miscarriage is a frequent adverse event in pregnancies for various reasons.The causes of miscarriages include embryonic genetic defects [14], maternal systemic diseases, genital abnormalities [15], endocrine abnormalities [16], immune abnormalities [17], fetal abnormalities, and some other factors [18].In this study, we analyzed and discussed the contribution of chromosomal abnormalities in miscarriages.CNV-Seq was the chosen technology for detecting variants.The further analysis explored the influences of maternal age and gestational weeks in detecting chromosomal abnormalities in miscarriages.Meanwhile, KEGG pathway analysis, GO analysis, and PPI analysis were used for bioinformatic analysis of genes in pCNVs, aiming to figure out potential occurrence mechanisms or candidate genes for miscarriages.
It's well-recognized that advanced maternal age is associated with miscarriage and fetal aneuploidy [28].With advanced maternal age, ovarian functions and egg qualities decline, which results in chromosome non-separation or aberration of germ cells and fertilized eggs during meiosis [29].Consistent with other studies, the proportion of numerical abnormalities rose as raised MA with statistical significance.As for structural abnormalities, a decreasing trend was found when maternal age increased, which also agreed with previous reports [30].However, some studies argued that the frequency of structural abnormalities seems to be unrelated to the maternal age [31], thus, large amounts of data are urgently needed for further analysis.In addition to advanced maternal age, our results also suggest that abortions occurring in early gestation might contribute to chromosomal abnormalities, in agreement with previous studies [30].
The contribution of numerical chromosomal abnormalities was discussed above, the following was the discussion of structural abnormalities.Of the 219 structural variants, 56 were pCNVs (11.91%, 56/470), and 146 (31.06%, 146/470) were VOUS.The detection rate of pCNVs (5.37%, 31/577) was consisted with reported at 2.2-13%, whereas the detection rate of VOUS was higher compared with other studies [32].We suspected that the inconsistent detection rate of VOUS might be a consequence of missing parental validation.Among 56 pCNVs, we found four confirmed regions related to miscarriages as 3q29 microduplication, 1p36 microdeletion, 5p deletion, and 8p23.1 deletion.In this cohort, 7q36.3 was frequent region (5/56, 8.83%) where pCNVs detected.As reported, deletion or duplication of 7q36.3 may result in developmental delay and congenital heart disease [33].There are 16 protein-coding genes located in this region,

Table 2 Statistics of cases with chromosomal abnormalities in various maternal age groups and gestational age groups
A In the group of MA < 30 y, 1 case was detected with both numerical and structural abnormalities, the structural abnormalities in this case detected were classified as pCNVs, this case was represented three times in this table.The number in the bracket represents the case mentioned, the same in below B In the group of 30 ≤ MA < 35 y, 3 cases were detected with both numerical and structural abnormalities, the structural abnormalities detected in these cases were classified as pCNVs, these cases were represented three times in this table C In the group of 35 ≤ MA < 40 y, 1 case was detected with both numerical and structural abnormalities, the structural abnormalities in this case detected were classified as pCNVs, this case was represented three times in this table D In the group of GW ≤ 12 w, 5 cases were detected with both numerical and structural abnormalities

Characters
Maternal age (MA) Gestational weeks (GW) among them, expression of the SHH gene was confirmed as impaired in the villus of recurrent miscarriages which means that dysfunction of SHH might link to miscarriages [34].We surfed previous studies, finding that deletions or duplications in 7q36.3 were detected in several POCs [6,35,36].Taken together, we hypothesized that 7q36.3 might be a candidate region related to miscarriages, for which more evidence is urged to indicate the association.
It's still a mystery how CNVs lead to miscarriage, although an increasing number of CNVs were analyzed to be associated with miscarriage.According to bioinformatic analysis as KEGG and GO, the genes performed biological processes like embryonic organ development, vasculogenesis, synapse organization, and histone modification.The following KEGG pathways were accurately related to miscarriages: MAPK signaling pathway, NF-κB signaling pathway, focal adhesion, and HIF-1 signaling pathway.Activating the p38/MAPK signaling pathway inhibits trophoblast proliferation, migration, and epithelial-mesenchymal transition (EMT), resulting in miscarriage [37].Whereas blocking MAPK/ERK1/2 signaling pathway may inhibit the growth and invasiveness of human trophoblasts [38].Inhibition of the NF-κB pathway might eliminate the invasiveness of trophoblasts [39].Moreover, over-activation of NF-κB was observed in early spontaneous miscarriages [40].HIF-1α/VEGF signaling pathway participates in villous angiogenesis, which is an important event for the development of the placenta or other embryonic organs [41].A deficiency of HIF-1α may result in fetal heart defects or increase the abortion rate [42].Our findings were generally in line with previous research, but we did not found genes linked to adherence junction pathway that Wu et al. [21] found to be most enriched in their cohort.This may be due to the differences among samples, more samples and systematic Fig. 3 GO, KEGG analysis and PPI network of selected genes.(A and B) GO analysis of 226 genes (P < 0.05).(C) Eleven enriched pathways in KEGG analysis (P < 0.05).(D) TP53, CTNNB1, UBE3A, EP300, SOX2 and ATM were selected to be significant genes in PPI network analysis are needed to comprehensively explain the potential mechanism of CNV-related miscarriages.Further, we selected seven genes (TP53, CTNNB1, UBE3A, EP300, SOX2, ATM, MECP2) according to their betweenness with other genes in the PPI network.TP53 is a vital gene in the pathways of cancers, which functions as a negative regulator to the proliferation of tumors.Summarizing evidence indicates that TP53 is related to miscarriages [43].CTNNB1 is an essential participant in the Wnt/β-catenin pathway and a candidate in early embryonic development [44].UBE3A is an imprinted gene related to Angelman syndrome.EP300, a member of transcriptional co-activator genes, acts importantly in multiple cellular processes, including embryo development.Inhibition of Ep300 may induce congenital disabilities and affect gender determination in the embryo [45].SOX2 is an essential transcriptional factor for cell differentiation, development, and sexual determination expressed in embryonic stem cells; SOX2 ablation would cause embryonic lethal after implantation [46].ATM is crucial for DNA repair and apoptosis, and Atm KD/ KD mice died in utero at E9.5 [47].Zhang et al. [48] discovered that ATM phosphorylates Thr145 and Thr156, two critical resides for the KH domain containing 3like (KHDC3L) which is a new miscarriage risk candidate.MECP2 plays a crucial role in interpreting epigenetic signatures that command chromatin conformation and regulation of gene transcription.As reported, mutations or increased dosages of the MECP2 gene may contribute to Rett Syndrome, which is lethal in the male embryo [49].Moreover, overexpression of Mecp2 in mice leads to lethal cardiac and skeletal malformations [50].Misregulation or malfunction of these genes may impair the development or implantation of an embryo, interfering with normal biological processes that might result in miscarriage.The chosen seven genes are somewhat associated with miscarriages, however, the precise relationship between CNVs and miscarriages is yet unclear mainly as the following two reasons.First, most CNVs cover multiple coding genes, thus, it is not easy to pinpoint the causative genes.Second, the changes in gene expression and expression mechanisms are complicated [51].To clarify the association, basic experiments need to be carried out for functional validation of pathways and genes selected.
According the findings above, it is evident that CNV-Seq is a useful high-resolution technology.In comparison to CMA, which relies on specific probes, CNV-Seq covers the entire genome, resulting in a larger detection region, and it is more cost-effective and convenient.However, it has limitations, as CNV-Seq cannot detect balanced translocation or uniparental disomy (UPD), both of which can be identified by CMA.Considering its high resolution for variants detection, minimal DNA sample requirements, and cost-effectiveness, CNV-Seq may be the preferred choice for clinical use in prenatal diagnosis or miscarriages.It is important to note that while our study indicated that CNVs were detected in most of miscarriages, the origins of variants were not clarified, representing a limitation of this study.In cases where couples are carriers of balanced chromosomal translocation, pregnancies may be lost due to the occurrence of abnormal chromosomes.As chromosomal abnormalities can be inherited from parents or occur de novo, it is essential to find out the origin as a crucial step for future pregnancies.

Conclusion
To summarize, chromosomal abnormalities were found in most miscarriage cases recruited in this study, particularly in the early abortion group, moreover, 7q36.3 was thought to be a possible CNV for miscarriages.Maternal age was validated as an independent factor of fetal chromosomal abnormalities in miscarriages.Our findings revealed that advanced MA populations have a high risk of aneuploidy, which should be mentioned during eugenic counseling.Still, the relationship between structural chromosomal abnormalities and MA needs further investigation.Clinicians should pay attention to pCNVs and VOUS detected in miscarriages and should be aware of parental sources seeking for CNVs.Bioinformatic analysis indicated participant pathways and central genes of CNVs identified in miscarriages, further functional studies are needed for validation.

Fig. 2
Fig. 2 Distribution of chromosomal abnormalities.(A) General classification of variants with respective proportions.(B) The number of aneuploid and P CNVs in different chromosomes.(C) Distribution of cases with positive or negative chromosomal results in various maternal age groups.(D) The bar plot showed the distribution of early or late miscarriage in various maternal age groups and the line chart showed the trends of chromosomal abnormalities detected in different gestational weeks as the maternal age increased